Blind Recovery of Perceptual Models in Distributed Speech and Audio Coding
نویسندگان
چکیده
A central part of speech and audio codecs are their perceptual models, which describe the relative perceptual importance of errors in different elements of the signal representation. In practice, the perceptual models consists of signal-dependent weighting factors which are used in quantization of each element. For optimal performance, we would like to use the same perceptual model at the decoder. While the perceptual model is signal-dependent, however, it is not known in advance at the decoder, whereby audio codecs generally transmit this model explicitly, at the cost of increased bit-consumption. In this work we present an alternative method which recovers the perceptual model at the decoder from the transmitted signal without any side-information. The approach will be especially useful in distributed sensor-networks and the Internet of things, where the added cost on bit-consumption from transmitting a perceptual model increases with the number of sensors.
منابع مشابه
Blind Tamper Detection in Audio using Chirp based Robust Watermarking
In this paper, we propose the use of ‘chirp coding’ for embedding a watermark in audio data without generating any perceptual degradation of audio quality. A binary sequence (the watermark) is derived using energy based features from the audio signal and chirp coding used to embed the watermark in audio data. The chirp coding technique is such that the same watermark can be derived from the ori...
متن کاملL2 Learners’ Lexical Inferencing: Perceptual Learning Style Preferences, Strategy Use, Density of Text, and Parts of Speech as Possible Predictors
This study was intended first to categorize the L2 learners in terms of their learning style preferences and second to investigate if their learning preferences are related to lexical inferencing. Moreover, strategies used for lexical inferencing and text related issues of text density and parts of speech were studied to determine their moderating effects and the best predictors of lexical infe...
متن کاملA warped linear-prediction-based subband audio coding algorithm
In this paper, a novel audio coding algorithm is proposed where the warped linear prediction (WLP) technique is employed to construct a perceptual preand post-filter for subband audio coding. A modified signal-to-mask ratio (SMR) calculation is given for subband coding of the WLP residuals of audio signals. The concept of perceptual entropy (PE) is extended to subband coding, resulting in the s...
متن کاملA Qualitative Meta-analysis of Perceptual-motor Problems in Visually Impaired People
Introduction: Perceptual motor activities improve motor skills and learning. These skills play an effective role in receiving, interpreting and responding to the sensory stimuli. This study aimed to identify perceptual-motor problems in visually impaired people. Methods: This qualitative research was conducted using a research synthesis method. Therefore, the analysis unit consisted of all the...
متن کاملWideband Speech Recovery Using Psychoacoustic Criteria
Manymodern speech bandwidth extension techniques predict the high-frequency band based on features extracted from the lower band.While this method works for certain types of speech, problems arise when the correlation between the low and the high bands is not sufficient for adequate prediction. These situations require that additional high-band information is sent to the decoder. This overhead ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016